AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Cross-Domain Visual Question Answering

# Cross-Domain Visual Question Answering

Llama3 Mova 8b
MoVA-8B is an open-source multimodal large language model that uses a coarse-to-fine mechanism to adaptively route and fuse visual expert modules for specific tasks. It can be used for research on multimodal models and chatbots.
Multimodal Fusion Transformers
L
zongzhuofan
835
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase